State generalization method with support vector machines in reinforcement learning

نویسندگان

  • Ryo Goto
  • Hiroshi Matsuo
چکیده

The conventional reinforcement learning assumes discrete state space. Therefore, it is necessary to make states discrete in order to handle continuous state environments. However, if a simple discretization is applied, the number of states increases exponentially with the dimension of the state space, and the learning time increases. In this paper, we propose a state generalization that is able to quickly adapt to environments by using Support Vector Machines. We conducted an experiment on the simulation task that navigates a robot to a goal. As a result of the experiment, the proposed method adapted to environment by a few trials.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

State Generalization with Support Vector Machines in Reinforcement Learning

The conventional reinforcement learning assumes discrete state space. Therefore, it is necessary to make states discrete manually in order to handle continuous state environments. However, if the simple discretization is applied, the number of states increases exponentially with the dimension of the state space, and the learning time increase. In this paper, we propose a state generalization th...

متن کامل

A QUADRATIC MARGIN-BASED MODEL FOR WEIGHTING FUZZY CLASSIFICATION RULES INSPIRED BY SUPPORT VECTOR MACHINES

Recently, tuning the weights of the rules in Fuzzy Rule-Base Classification Systems is researched in order to improve the accuracy of classification. In this paper, a margin-based optimization model, inspired by Support Vector Machine classifiers, is proposed to compute these fuzzy rule weights. This approach not only  considers both accuracy and generalization criteria in a single objective fu...

متن کامل

On-line Support Vector Machines for Function Approximation

This paper describes an on-line method for building ε-insensitive support vector machines for regression as described in (Vapnik, 1995). The method is an extension of the method developed by (Cauwenberghs & Poggio, 2000) for building incremental support vector machines for classification. Machines obtained by using this approach are equivalent to the ones obtained by applying exact methods like...

متن کامل

A Comparative Study of Extreme Learning Machines and Support Vector Machines in Prediction of Sediment Transport in Open Channels

The limiting velocity in open channels to prevent long-term sedimentation is predicted in this paper using a powerful soft computing technique known as Extreme Learning Machines (ELM). The ELM is a single Layer Feed-forward Neural Network (SLFNN) with a high level of training speed. The dimensionless parameter of limiting velocity which is known as the densimetric Froude number (Fr) is predicte...

متن کامل

Modeling of Reinforcement in Concrete Beams Using Machine Learning Tools

The paper discusses the results obtained to predict reinforcement in singly reinforced beam using Neural Net (NN), Support Vector Machines (SVM’s) and Tree Based Models. Major advantage of SVM’s over NN is of minimizing a bound on the generalization error of model rather than minimizing a bound on mean square error over the data set as done in NN. Tree Based approach divides the problem into a ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Systems and Computers in Japan

دوره 37  شماره 

صفحات  -

تاریخ انتشار 2006